Using Unlabelled Data To Update Classification Rules With Applications In Food Authenticity Studies

نویسندگان

  • Nema Dean
  • Thomas Brendan Murphy
  • Gerard Downey
چکیده

A classification method is developed to classify samples when both labelled and unlabelled samples are available. The classification rule is estimated using both the labelled and unlabelled data, in contrast to many classical methods which only use the labelled data for estimation. This methodology models the data as arising from a Gaussian mixture model with parsimonious covariance structure, as is done in model-based clustering (Fraley and Raftery (2002)). A missing-data formulation of the mixture model is used and the models are fitted using the EM and CEM algorithms. A comparison of the performance of model-based discriminant analysis and the proposed method of classification is given. The methods are applied to the analysis of spectra of foodstuffs recorded over the visible and near-infrared wavelength range in food authenticity studies. The aim of this study is to classify the foodstuffs using their spectra. The proposed classification method is shown to yield very good misclassification rates. The correct classification rate was observed to be as much as 15% higher than the correct classification rate for model-based discriminant analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variable Selection and Updating in Model-based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications by Thomas

Food authenticity studies are concerned with determining if food samples have been correctly labeled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-superv...

متن کامل

Variable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications.

Food authenticity studies are concerned with determining if food samples have been correctly labelled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-super...

متن کامل

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

Automatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems

With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...

متن کامل

The influence of event attributes on tourist’s loyalty: Evidence from the Ashoura event in Yazd City

Many studies have found that the perceived authenticity of cultural and religious events affects event satisfaction and loyalty. Little is currently known about how perceived authenticity is affected by the facilities, such as food and the availability of information, which are independent determinants of satisfaction and loyalty. This study aims to examine the antecedents of event loyalty. Que...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004